This model is a Vision Transformer (ViT) fine-tuned on the Food-101 dataset, statically quantized to INT8 using the Optimum tool and exported in OpenVINO Intermediate Representation format, suitable for efficient image classification tasks.
Image Classification
Transformers